KMID : 0917520080150040097
|
|
Journal of Speech Sciences 2008 Volume.15 No. 4 p.97 ~ p.105
|
|
Control of Duration Model Parameters in HMM-based Korean Speech Synthesis
|
|
Kim Il-Hwan
Bae-Keun Seung
|
|
Abstract
|
|
|
Nowadays an HMM-based text-to-speech system (HTS) has been very widely studied because it needs less memory and low computation complexity and is suitable for embedded systems in comparison with a corpus-based unit concatenation text-to-speech one. It also has the advantage that voice characteristics and the speaking rate of the synthetic speech can be converted easily by modifying HMM parameters appropriately. We implemented an HMM-based Korean text-to-speech system using a small size Korean speech DB and proposes a method to increase the naturalness of the synthetic speech by controlling duration model parameters in the HMM-based Korean text-to speech system. We performed a paired comparison test to verify that theses techniques are effective. The test result with the preference scores of 73.8% has shown the improvement of the naturalness of the synthetic speech through controlling the duration model parameters.
|
|
KEYWORD
|
|
HMM, speech synthesis, HTS, state-duration model
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|